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Rheumatoid arthritis (RA) is strongly associated with the human leukocyte antigen [HLA)- 
DRB1 locus that possesses the shared susceptibility epitope (SE) and the citrullination of 
self-antigens. We show how citrullinated aggrecan and vimentin epitopes bind to HLA- 
DRB1*04:01/04. Citrulline was accommodated within the electropositive P4 pocket of 
HLA-DRBr04:01/04, whereas the electronegative P4 pocket of the RA-resistant HLA- 
DRB1*04:02 allomorph interacted with arginine or citrulline-containing epitopes. Peptide 
elution studies revealed P4 arginine-containing peptides from HLA-DRB1*04:02, but not 
from HLA-DRB 1*04:0 1/04. Citrullination altered protease susceptibility of vimentin, 
thereby generating self-epitopes that are presented to T cells in HLA-DRB1*04:01 + indi- 
viduals. Using HLA-II tetramers, we observed citrullinated vimentin- and aggrecan-specific 
CD4 + T cells in the peripheral blood of HLA-DRBr04:01 + RA-affected and healthy indi- 
viduals. In RA patients, autoreactive T cell numbers correlated with disease activity and 
were deficient in regulatory T cells relative to healthy individuals. These findings reshape 
our understanding of the association between citrullination, the HLA-DRB1 locus, and 
T cell autoreactivity in RA. 



The human leukocyte antigen (HLA) locus plays a 
vital role in immunity; it encodes highly poly- 
morphic molecules that present peptides to 
T lymphocytes, where HLA polymorphisms 
serve to broaden the repertoire of peptides that 
different HLA allotypes can bind. ManyT cell- 
mediated autoimmune diseases are linked to 
the expression of particular HLA molecules. For 
example, certain HLA-class I allotypes are asso- 
ciated with inflammatory diseases (Bharadwaj 
et al., 2012). Moreover, strong HLA-I associa- 
tions are present with certain drug hypersensi- 
tivity reactions (Illing et al., 2012). HLA-class II 
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allele associations with autoimmune diseases are 
much more common than HLA-I associations, 
but there are few examples in which the mech- 
anism is well understood (Jones et al., 2006; 
Henderson et al, 2007). The HLA-II mole- 
cules are encoded by the highly polymorphic 
HLA-DR, DQ, and DP loci.The polymorphisms 
are found largely within the antigen-binding 
pocket of these molecules, but in HLA-DR they 
are confined to the DR(3 chain (DRB1, 3, 4, and 5 
genes) with the DRct chain being essentially 
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monomorphic. Notwithstanding some HLA disease associa- 
tions, little is known about the nature of the HLA-bound 
self-peptides that are involved in autoimmunity, limiting de- 
velopment of specific immune intervention strategies aimed to 
inhibit or prevent such deleterious immune responses. Never- 
theless, rheumatoid arthritis (RA) is arguably one of the best- 
described systems for understanding the genetic association 
between HLA-II alleles, autoimmunity, and self-peptide pre- 
sentation (Raychaudhuri et al., 2012;Viatte et al., 2013). 

RA is a systemic autoimmune diseases, afflicting ^1% of 
the population (Helmick et al., 2008). RA is characterized by 
inflammation of synovial tissues in the joints, pannus forma- 
tion, and erosion of the bones (Klareskog et al., 2009). Like 
most human autoimmune diseases, multiple genes contribute 
to RA susceptibility and severity (Viatte et al, 2013). The most 
comprehensive genetic association exists with HLA-DRB 1 
genes and in particular the HLA-DR4 alleles. Specifically, 
the association has been mapped to a highly polymorphic 
N-terminal region of the HLA DR(3 chain around positions 
70—74 (Viatte et al., 2013). This region encodes a conserved 
positively charged residue at position 71 that is thought to 
dictate the nature of the amino acid that is accommodated in 
the P4 pocket of the antigen-binding groove (Hammer et al., 
1995). Alleles having this shared conserved region of the DR(3 
70—74 region are termed to have a shared susceptibility epi- 
tope (SE; Gregersen et al., 1987) and include the commonly 
occurring HLA DRB1*04:01, *04:04, and *01:01 molecules. 
Recently, a large haplotype association study involving >5,000 
seropositive RA patients and 15,000 controls has attributed 
most of the DR-associated risk to positions 11, 13, 71, and 74 
of the HLA-DRpi polypeptide chain encoded by SE alleles 
(Raychaudhuri et al., 2012), strongly suggesting that this allo- 
type permits binding and presentation of autoantigenic pep- 
tides. In addition, HLA-DRB 1*04 + individuals had accelerated 
CD4 + T cell telomere erosion and immunosenescence com- 
mencing early in life, relative to HLA-DRB1*04~ individuals, 
regardless of the development of RA (Schonland et al., 2003). 
However, the molecular basis for the RA association with the 
SE remains unclear. 

Citrullination, the conversion of arginine to citrulline, is 
a physiological process catalyzed by peptidyl arginine deimi- 
nases (PAD;Vossenaar and vanVenrooij, 2004). This process is 
increased during inflammation, stress, and apoptosis, and ex- 
pands the repertoire of presented epitopes after protein im- 
munization (Klareskog et al., 2008). Citrullinated proteins and 
PAD (arising from inflammatory cells) are found in RA pa- 
tient synovium (Vossenaar et al., 2004; Foulquier et al., 2007) 
and in RA- and non— RA-associated pneumonia (Bongartz 
et al., 2007). Moreover, expression of citrullinated proteins is 
up-regulated in the lung epithelial cells of healthy smokers 
relative to nonsmokers (Makrygiannakis et al.,2008). Consis- 
tent with this observation, smoking increases the risk of de- 
veloping anticitrullinated protein antibody (ACPA)-positive 
RA, particularly in SE + individuals (Padyukov et al., 2004; 
Klareskog et al., 2008). Numerous citrullinated autoantigens, 
of which most are ubiquitous proteins, have been identified 
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in RA (Hill et al, 2003, 2008;Vossenaar et al, 2004;Vossenaar 
and van Venrooij, 2004; Klareskog et al., 2008; Law et al., 
2012), with some showing cross reactivity with microbial an- 
tigens (Lundberg et al., 2008). Indeed, autoantibodies specific 
for citrullinated antigens are found in the serum of RA pa- 
tients and are highly specific to the disease (van Gaalen et al., 
2004; Klareskog et al., 2008; Klareskog et al, 2009). Over the 
last decade, this observation has led to a rapid clinical transla- 
tion and adoption of ACPA reactivity as an important diagnos- 
tic tool, including the prediction of more erosive outcomes in 
RA (Klareskog et al, 2009; Klareskog et al, 2008; van Gaalen 
et al., 2004). ACPA may directly influence joint inflammation 
and erosion through local binding of citrullinated proteins 
(Kuhn et al., 2006; Harre et al, 2012). Moreover, HLA-DRB 1 
susceptibility alleles are strongly associated with ACPA-positive 
RA, strengthening the conclusion that the HLA-SE mole- 
cules restrict antigen presentation of citrullinated autoanti- 
gens (Huizinga et al, 2005; Klareskog et al., 2008, 2009; 
van Gaalen et al., 2004). However, despite the clinical utility 
of elucidating autoantibody responses toward them, the pre- 
cise role of citrullinated antigens in the initiation and/ or pro- 
gression of RA has remained elusive. 

RESULTS 

Structural basis of citrullinated epitopes presentation 

Several citrullinated (cit) epitopes, including vimentin 59 _ 71 
(GVYATR/citSSAVR/citLR/cit; Snir et al, 2011), vimen- 
tin 66 _ 78 (SAVRAR/citSSVPGVR; Hill et al, 2003; Law et al, 
2012), fibrinogen-ct 79 _ 91 (QDFTNR/citlNKLKNS; Hill et al, 
2008; Law et al, 2012), and aggrecan 84 _ 103 (WLLVATEGR/ 
CitVRVNSAYQDK; Law et al., 2012; von Delwig et al., 2010) 
are associated with ACPA + RA and the SE-encoded HLA 
alleles. To establish the basis of citrullination-dependent bind- 
ing to the SE-HLA allomorphs (Fig. 1, a and b), we deter- 
mined the high resolution structures of HLA-DRB 1*04:01 
complexed to vimentin 59 _ 71 epitopes that were citrullinated at 
position 64 (vimentin-64Cit 59 _ 71 ), as well as at positions 64, 
69, and 71 (vimentin-64-69-71Cit 59 _ 71 );the vimentin 66 ^ 78 epi- 
tope that was citrullinated at position 71 (vimentin-71Cit 66 ^ 78 ); 
and the aggrecan 89 _ 103 epitope that was citrullinated at posi- 
tions 93 and 95 (aggrecan-93-95Cit 89 _ 103 ). This provided a 
broad perspective of how citrullination of epitopes enables 
HLA-DRB 1*04:01 binding (Fig. 1; Fig. 2, a-c; and Table 1). 
The citrullinated epitopes were located within the Ag-binding 
cleft of HLA-DRB 1*04:01, and all four structures adopted 
a very similar conformation and were similar to previously 
determined HLA-DR4 structures that bound noncitrulh- 
nated antigens (Dessen et al., 1997; Fig. 1, c and d; and Fig. 2, 
a— c).The vimentin-71Cit 66 _ 78 epitope bound in a linear, ex- 
tended manner with Pl-Val, P4-Cit, P6-Ser, and P9-Gly oc- 
cupying the PI, P4, P6, and P9 pockets of HLA-DRB 1*04:01, 
respectively, whereas P2-Arg, P5-Ser, P7-Val, P8-Pro, and 
Pll-Arg represented potential TCR contact sites (Fig. 1 c). 
The P4-Cit bent back upon itself and adopted a constrained 
U-shaped conformation in which its aliphatic moiety packed 
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Amino Acid No 11 20 / 61 70 80 

DRB1*04:01 VKHECHFFNG / WNSQKDLLEQ KRAAVDTYCR 

DRB1*04:04 / R 

DRB1*04:02 / 1 — D E 




Figure 1. HLA-DRB1*04:01 in complex with Vimentin-71 Cit 66 _ 78 

(a) Polymorphic residues involved in susceptibility to RA. The peptide- 
binding groove of an HLA-DR molecule is shown in cartoon representa- 
tion with the a-chain colored in green and the p-chain colored in pink. 



against Phe26(3, Tyr78(3, and Hisl3(3 of HLA-DRB1*04:01 
(Fig. 3 a). Of the residues within the SE, positions 72 and 73 
pointed away from the P4 pocket, whereas position 74 was 
orientated toward the pocket, packing against Phe26(3, yet did 
not contact the P4-Cit.The citrullinated head group formed 
a direct H-bond to Lys71(3 NZ , the latter of which was stabi- 
lized by a salt bridge to Asp28(3, and an H-bond to P5-Ser° 
(Fig. 3 a). A P4-Arg could not be accommodated within this 
P4 pocket, as Lys71(3 would electrostatically repel the posi- 
tively charged guanidinium head group and, moreover, there 
is insufficient space surrounding the P4 pocket to enable 
Lys71(3 or P4-Arg to adopt differing conformations, which 
is consistent with the peptide elution data (discussed below). 
Although there were some sequence differences between 
the vimentin 59 _ 71 and vimenting^g epitopes, which related to 
differing anchor residue interactions at the PI (Val — > Tyr) 
and P9 pockets (Gly — ► Arg; Fig. 2 a), the P4-Cit residues 
adopted essentially identical interactions within the P4 pocket 
(Fig. 3 b). Moreover, in the vimentin-64-69-71Cit 59 ^ 71 epitope, 
the P4-Cit adopted a very similar conformation to that ob- 
served in the vimentin-64Cit 59 _ 71 epitope (Fig. 2 b and Fig. 3 c). 
While the C-terminally located Pll-Cit of vimentin-64-69- 
71Cit 59 _ 71 was solvent exposed and mobile, the P9-Cit occu- 
pied the P9 pocket of HLA-DRB 1*04:01 (Fig. 2 b). Here, 
the P9 pocket seemed equally well suited to accommodate 
P9-Arg or P9-Cit, withTyr37(3 H-bonding to both moieties 
(not depicted). The ready accommodation of P9-Arg/P9-Cit 
within the P9 pocket was consistent with the similar thermal 
stability values for HLA-DRBl*04:01-vimentin-64Cit 59 _ 71 
andHLA-DRBl*04:01-vimentin-64-69-71Cit 59 _ 71 (Tm of 
66.7°C and 69.1°C, respectively; Table 2). The structure of 
the HLA-DRBl*04:01-aggrecan-93-95Cit 89 _ 103 complex 
showed that the positioning of the P4-Cit, and the immedi- 
ate environment of the P4 pocket, was very similar to that of 
the HLA-DRBl*04:01-vimentin complexes, despite the dif- 
fering hydrogen bonding network with Lys71(3 (Fig. 2 c and 
Fig. 3 d). In the HLA-DRB 1 *04:01-aggrecan-93-95Cit 89 _ 103 
complex, the P2-Cit was highly solvent exposed (Fig. 2 c 
and Fig. 3 d), suggesting that citrullination of this position 
could potentially impact on TCR recognition. Hence, the P4 
pocket of HLA-DRB1*04:01 was highly suited to preferentially 



Residues Val 11 (J, His13(3, Lys7 1 p, and Ala74p are represented as sticks 
and correspond to the residues present in HLA-DR401, the HLA with the 
highest risk associated with RA. (b) Sequence alignment of the three HLA- 
DRB1*04 alleles used in this study showing amino acid polymorphisms. 
"-" indicates residue conserved with that of HLA-DRB1*04:01 :01. 
Vail 1 (3, His13|3 are conserved in all three alleles (not depicted), (c) HLA- 
DRB1*04:01 in complex with vimentin-71Cit 66 _ 78 . The vimentin-7lCit 66 _ 78 
peptide is bound in the peptide-binding groove, with carbons colored in 
yellow, nitrogens colored in blue, and oxygens colored in red. The a and 
(3 chains are shown in cartoon representation, and colored in green and 
pink, respectively, (d) Side view of the bound vimentin-7 1 Cit 66 _ 78 peptide. 
The peptide's 2Fo-Fc electron density map is shown in blue and contoured 
to 1 <t, showing unambiguous density for the peptide. Peptide residues 
are labeled and numbered, with Citrulline71 occupying the P4 pocket. 
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a HLA-DRB1*04:01Vimentin-64Cit sfl7 , b HLA-DRB1*04:01Vimentin-64-69-71Cit 59 71 

4 



L10 



L10 



V-1 A2 




C HLA-DRB1*04:01Aggrecan-93-95Cit 89 . 103 d HLA-DRB1*04:04Vimentin-71Cit 66 . 78 





e HLA-DRB1*04:02Vimentin-71Cit 66 . 78 f HLA-DRB1*04:02Vimentin 6 




Figure 2. Side view of epitopes bound to HLA-DR4. (a) HLA-DRBT04:01 bound to vimentin-64Cit 59 _ 71 . (b) HLA-DRB1*04:01 bound to vimentin- 
64-69-71Cit 59 _ 71 . (c) HLA-DRB1*04:01 bound to aggrecan-93-95Cit 89 _ 103 . (d) HLA-DRB1*04:04 bound to vimentin-7 1 Cit 59 _ 71 . (e) HLA-DRB1*04:02 bound 
to vimentin-7 1Cit 66 _ 78 . (f) HLA-DRB 1*04:02 bound to Vimentin 66 _ 78 . The peptide's 2Fo-Fc electron density map is shown in blue and contoured to 1 a. 
Peptide residues are labeled and numbered. 



accommodate citrulline over the corresponding Arg residue, 
with Lys7 1 (3 of the SE playing a key discriminatory role. 

HLA DRp polymorphisms and RA susceptibility 

HLA DR(3 polymorphisms are closely associated with RA 
disease susceptibility (Raychaudhuri et al., 2012;Viatte et al., 
2013). For example, although the HLA-DRB1*04:01 allele is 
strongly associated with RA susceptibility (odds ratio [OR] 
4.44), HLA-DRB1*04:08, *04:05, *04:04, and *10:(U allo- 
morphs are, by comparison, marginally less associated (OR > 
4.22), whereas allomorphs such as HLA-DRB1*04:02 and 
*13:01 are considered RA resistant/protective (OR 1.43 and 
0.59, respectively; van der Woude et al., 2010; Raychaudhuri 
et al., 2012;Viatte et al., 2013). These differing associations are 
associated with polymorphic differences mapping to positions 
11, 13, 71, and 74 (Fig. 1, a and b; Raychaudhuri et al, 2012; 
Viatte et al., 2013). To establish the differing hierarchies of 
RA disease susceptibility, we determined the structures of 
HLA-DRB1*04:04 and HLA-DRB1*04:02 in complex with 



vimentin-Cit71 66 _ 7g (Fig. 2, d and e; and Table 1). HLA- 
DRB1*04:01 differs from HLA-DRB1*04:04 by 2 aa, of 
which a K — > R polymorphism maps to position 71 Thus, the P4 
pocket remains positively charged within HLA-DRB 1*04:04, 
thereby disfavoring P4-Arg at this position. The P4-Cit of 
vimentin-7 lCit 66 _ 78 in DRB1*04:04 occupied a similar posi- 
tion to that observed in HLA-DRBl*04:01,but was in a more 
extended conformation (Fig. 3 e). Instead P4-Cit pointed to- 
ward and directly contacted Gln70(3 and Ala74(3 and H-bonded 
to Arg71(3 of HLA-DRB1*04:04, the latter of which occupied 
a very similar position to Lys71(3 (Fig. 3 e).Thus, the similar- 
ity of the P4 pockets of HLA-DRB1*04:01 and HLA- 
DRB1*04:04 provided a basis for the similar disease association 
of these allomorphs. The disease-associated effect of the poly- 
morphisms at positions 11 and 13 in the DR(3 chain is less 
clear. Position 1 1 resides within the P6 pocket, packing against 
Hisl3(3, the latter of which formed van derWaals contacts 
with the aliphatic moiety of P4-Cit. Therefore, a Hisl3(3Ser 
polymorphism, as observed in the protective HLA-DRB*13:01 
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Table 1. Data collection and refinement statistics 



DR40lVim- 

7lCit 66 _ 7a 



DR401Vim- 

64Cit 59 _ 71 



DR401Vim-64- 
69-7lCit 59 _ 7 , 



DR401Agg-93- 

95Cit 89 _ 103 



DR402Vim K 



DR402Vim- 
7lCit 66 _ 78 



DR404Vim- 

7lCit 66 _ 78 



Space group 
Cell dimensions 
a.b.c (A) 
Resolution (A) 

Total no. observations 
No. unique 
observations 

Multiplicity 
p 

merge 

R a 

1 'pirn 

<I/ctI> 

Completeness (°/o) 

Refinement Statistics 

Non-hydrogen atoms 

Protein 

Water 

Ligand 

R /R b ' c 

1 'factor/ 1 'free 

Rms deviations from 

ideality 
Bond lengths (A) 
Bond angles (°) 
Dihedrals (°) 
Ramachandran plot 
Favored regions (°/o) 
Allowed regions (°/o) 



C222, 

67.9, 1 77.8, 76.7 
62.73-2.30 (2.42- 

2.30) 
120743 (17589) 
20836 (2991) 

5.8 (5.9) 
14.7 (49.5) 
6.7 (22.3) 

9.5 (3.4) 
100 (100) 

3500 
3145 
292 
61 

17.8/22.0 



0.0049 
1.015 
14.1 

98.4 
1.6 



C222, 

67.1, 183.4,77.3 
48.84-2.41 (2.54- 

2.41) 
88651 (13122) 
18858 (2717) 

4.7 (4.8) 

1 5.8 (49.3) 

8.2 (25.4) 

7.3 (3.0) 

99.9 (100) 

3428 
3136 
239 
53 

18.9/23.1 



0.0034 
1.018 
14.3 

98.1 
1.9 



C222, 

67.2,183.6, 77.4 
91.26-2.20 (2.32- 

2.20) 
109458 (16044) 
23984 (3509) 

4.6 (4.6) 
15.5 (60) 
8 (31.2) 
7.6 (2.6) 
97.5 (98.3) 

3603 
3152 
354 
97 
17.1/20.7 



0.0075 
1.341 
14.7 

97.9 
2.1 



C222, 

67.1, 182.5, 77.5 
62.97-1.95 (2.06- 

1.95) 
222923 (18256) 
34301 (4325) 

6.5 (4.2) 

12.2 (45.7) 
5.1 (24.4) 

11.3 (2.6) 
97.5 (85.8) 

3728 
3168 
470 
90 
16.5/20.9 



0.0062 
1.059 
13.9 

98.7 
1.3 



C222, 



C222, 



C222, 



66.4, 1 82.5, 77.81 67.0, 1 82.9, 77.4 67.4, 1 83.0, 77.5 



62.43-1.70(1.79- 62.93-2.0 (2.11- 

1.70) 2.0) 

334634(49187) 228336(33111; 

52300(7557) 32616(4690) 



6.4 (6.5) 
10.5 (58.8) 
4.4 (24.3) 

12.3 (2.9) 
99.8 (99.8) 

3833 
3238 
535 
60 
16.2/18.8 



0.0046 
1.035 
15.2 

99 



7.0 (7.1) 
12.2 (49.7) 

5 (20.1) 
12.5 (3.5) 
100 (100) 

3672 
3204 

397 

71 

16.1/20.3 



0.007 
1.113 
16.1 

98.2 
1.8 



45.75-1.65(1.74- 

1.65) 
415853 (59544) 
58009 (8355) 

7.2 (7.1) 
10.0 (47.2) 
4.0 (18.9) 
10.6 (3.3) 
100 (100) 

3866 
3291 
466 
109 
16.3/18.6 



0.0053 
1.03 
14.5 

98.2 



a R, lm = 2 hkl [1/(N - DPSi I l ha ,i - <l hkl > <l hkl > 

b f>factor = E I I F 0 1 — I F c | |) / (2 |F 0 |) — for all data except as indicated in footnote c. 

c 5°/o of data were used for the R free calculation. Values in parentheses refer to the highest resolution bin. 



allomorph (Raychaudhuri et al., 2012;Viatte et al., 2013) is 
likely to impact the packing of the P4 residue. Regardless, a 
key difference between the HLA-DRB1*04:01 and HLA- 
DRB 1*04:02 allomorphs is that the latter possesses Asp70(3 
and Glu71(3, which enabled it to bind P4-Arg and P4-Cit 
(Tm of 77.1°C and 84.3°C, respectively; Table 2). Accord- 
ingly, we determined the structures of HLA-DRB1*04:02 in 
complex with Vimentin-71Cit 66 _ 78 and Vimentin 66 _ 78 (Fig. 2, 
e and f).The presence of Glu71(3, which caused a slight ad- 
justment of neighboring residues in comparison to the HLA- 
DRB 1*04:01 complex, enabled a direct H-bond and salt bridge 
to be formed with P4-Cit and P4-Arg, respectively (Fig. 4, 
a and b). In addition, Asp70(3 reoriented to form a salt bridge 
with P4-Arg. Hence, P4-Arg can be readily accommodated 
in some of the RA-protective HLA-DRB1 allomorphs due 
to the conversion toward a more electronegative P4 pocket 
(Fig. 4, c and d). 

Antigen processing and HLA-DR4 peptide repertoire 

To examine the propensity of the differentially RA-associated 
HLA-DR4 alleles to tolerate P4-Arg residues, we generated 



T2 cell lines (class II— deficient) that expressed HLA-DM and 
HLA-DRB1*04:01, *04:02, or *04:04. Accordingly, in contrast 
to previous studies on HLA-DR4-binding motifs (Hammer 
et al., 1993; Sette et al., 1993; Hammer et al, 1995), our data 
arises from a large number of novel naturally processed and 
presented peptides identified, using a common platform, from 
cells that express a single HLA-DR molecule that sample 
peptides from the same parental cell proteome. Our approach 
enabled an in-depth analysis of the repertoire of peptides 
bound to each HLA-DR allele. Over 1000 high confidence 
peptides were identified for each DR allele, elucidating HLA-II 
binding motifs for HLA-DRB 1*04:01 (« = 1058), HLA- 
DRB1*04:04 (« = 1797) and HLA-DRB 1*04:02 (« = 1239). 
These endogenous peptide sequences determined from mul- 
tiple peptide elution experiments were identified with high 
confidence using strict bioinformatic criteria that included 
the removal of common contaminants (Dudek et al., 2012). 
The motifs generated using this approach were in general 
agreement with previously determined motifs (Hammer 
et al., 1995; Sette et al., 1993), specifically exhibiting signifi- 
cantly different specificities at PI and P4 (Fig. 5 a). Namely, 
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Figure 3. Interactions with citrulline in 
the P4 pocket of HLA-DRB1*04:01 and 
HLA-DRB 1*04:04. (a) Vimentin-71 Cit 66 _ 7a 
colored in yellow, (b) vimentin-64Cit 59 _ 71 col- 
ored in pink, (c) vimentin-64-69-71Cit 59 _ 7l 
colored in green, and (d) aggrecan-93-95Cit 89 _ 103 
(colored in blue, bound to HLA-DRB1 *04:01). 
Residues from the p chain important for con- 
tacts with the P4 citrulline are represented as 
sticks, (e) Vimentin-71 Cit 66 _ 7 
bound to HLA-DRB1*04:04. 



whereas P4-Arg was absent in all the peptides bound to 
HLA-DRB1*04:01 and HLA-DRB 1*04:04, arginme for 
HLA-DRBl*04:02-bound peptides was better tolerated in 
this position (Fig. 5 a). These data are consistent with HLA- 
DRBl*04:01/04 disfavoring P4-Arg in vitro (Fig. 3) and not 
being selected at all in vivo (Fig. 5 a). In contrast HLA- 
DRB 1*04:02 has a propensity to bind P4-Arg in vitro (Fig. 4 b) 
and is permissive to P4-Arg containing peptides in vivo 
(Fig. 5 a) with 1.7% of naturally selected peptides containing 
a P4-Arg. In addition to satisfying the binding requirements 
to HLA-DRB1*04:01 and other RA-associated HLA-DR 
allotypes, we hypothesized that differential processing of 



citrullinated peptides may also contribute to their antigenic- 
ity. To establish this, we expressed recombinant vimentin and 
citrullinated it using the PAD2 enzyme. We compared in vitro 
cathepsin L digestion patterns of native and citrullinated 
vimentin and observed relative protection of the vimentin 59 _ 71 
epitope when the antigen was citrullinated at positions 64, 69 
and 71 (Fig. 5 b). Similar differences in cleavage patterns were 
observed using synthetic peptides encompassing the native 
and citrullinated vimentin 57—71 region (not shown). This sug- 
gests that citrullination not only facilitates binding of auto- 
antigenic epitopes to RA-associated HLA allotypes but that the 
modification of arginine residues also alters protease cleavage 
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Table 2. Thermostability data 



Sample 


Tm (°C) 


DR401 CLIP 


63.0 ± 0.99 


DR40lVim-64Cit 59 _ 71 


66.7 ± 1.64 


DR401Vim-64-69-71Cit 59 _ 7 , 


69.1 ± 0.58 


DR40lVim-7lCit 66 _ 78 


58.9 ± 2.17 


DR401Agg-93-95Cit 89 . l03 


64.2 ± 0.87 


DR402CLIP 


76.8 ± 1.24 


DR402Vim 66 _ 78 


77.1 ± 0.47 


DR402Vim-71Cit 66 . 7S 


84.3 ± 2.71 


DR404CLIP 


73.5 ± 0.41 


DR404Vim-71Cit 66 . 78 


83.0 ± 0.92 



patterns protecting regions of the antigen normally degraded 
in APCs.Thus, citrullination has a double-edged effect, both 
permitting SE binding and preventing degradation of post- 
translationally modified epitopes that can be presented to auto- 
reactive T cells in the context of the SE. 

Ex vivo T cell analysis using HLA DR4 tetramers 

Next, we aimed to identify circulating citrullinated epitope- 
specific CD4+T cells. We recruited 20 HLA-DRB1*04:0P 



RA patients and 6 HLA-matched healthy controls, with the 
RA patients possessing a range of disease durations, disease ac- 
tivity, and treatments (Table 3). We generated phycoerythrin 
(PE)-labeled HLA-DRB1*04:01 tetramers complexed with 
either: control influenza hemagglutinin (HA) 306 _ 318 , vimentin- 
64Cit 59 _ 71 , or aggrecan-93-95Cit 89 _ 1(13 peptides. We demon- 
strated that gating based on PE fluorescence-minus-one (FMO) 
staining reliably gates HA-specific T cells in immunized mice 
without background in saline-treated mice (unpublished 
data), and then showed specificity of the T cells using tetra- 
mers labeled with different fluorochromes (Tung et al., 2007; 
Fig. 6 a and not depicted). Although we determined the me- 
dian absolute number of CD4 + T cells to be 7 x 10 4 in healthy 
controls and 10.2 x lOVml blood in RA patients, the me- 
dian number of HA, cit-vimentin, or cit-aggrecan HLA- 
DRB1*()4:01 tetramer + cells ranged from 47 to 80/ml in RA 
patients and 30— 40/ml in healthy controls — a frequency of 
^1/2,000 CD4 + T cells. There was no significant difference 
between RA patients and healthy controls in the number of 
CD4 + or tetramer + T cells/ml (Mann- Whitney test compared 
RA patients and controls for each specificity; Fig. 6 b). 
However, the number of vimentin-64Cit 59 _ 71 (spearman r = 
0.76; P < 0.05) or aggrecan-93-95Cit 89 _ 103 tetramer + T cells 
(spearman r = 0.76; P < 0.05), but not the total number of 
CD4 + T cells, was correlated with RA disease activity score 
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Figure 4. Comparison of the interactions 
between citrulline and arginine in the P4 
pocket of HLA-DRB1*04:02. (a) Vimentin- 
71Cit 66 _ 78 colored in green; (b) Vimentin 66 _ 78 
colored in purple. The solvent-accessible elec- 
trostatic potential was calculated for panel c 
HLA-DRB1*04:01 and (d) HLA-DRB1*04:02 
bound to vimentin-71 Cit 66 _ 7a . Electrostatic 
calculations were performed using APBS 
(±12«7e). 
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Figures. HLA-DRBT04 binding motifs 
and protease sensitivity of citrullinated 
epitopes, (a) HLA Binding motifs of 
DRB1*04:01, DRB1*04:02 and DRB1*04:04 
were generated from immunoaffinity purified 
allotypes isolated from T2-DRB1*04:01, 04:02 
and 04:04 cells expressing DM. Each HLA DR 
allotype was affinity purified, and bound pep- 
tides were isolated and analyzed by Liquid 
chromatography-mass spectrometry (LC- 
MS/MS).To generate peptide-binding motifs, the 
minimal core sequences found within nested 
sets were extracted and the resulting list of 
peptides aligned and visualized using Icelogo. 
Positively associated residues (P > 0.05) at 
each relative position are shown above the 
x-axis and negatively associated residues are 
shown below. Residues height is proportion- 
ate to prevalence, with residues shown in pink 
having infinite height reflecting absolute 
presence or absence at that position in the 
bound peptides, (b) Citru I li nation alters cleav- 
age of vimentin by Cathepsin L. Recombinant 
human vimentin was citrullinated in vitro, and 
the Cathepsin L digestion patterns of native 
and citrullinated vimentin were observed by 
LC-MS/MS. Observed cleavages are high- 
lighted by arrows in the region of vimentin- 
spanning residues 51-81. The amount of 
selected peptides (as determined by area 
under the curve quantitation for extracted 
ion chromatograms) from this region that span 
the immunogenic 59-71 region of vimentin 
are shown as a function of digestion time 
(1, 5, 3, and 60 min digests). 



Time (min) 



(DAS4vCRP; Fig. 6 c).Vimentin-64Cit 59 _ 71 and aggrecan-93- 
95Cit g9 ^ 103 mean fluorescence staining intensity (MFI) was 
significantly lower in RA patients than healthy individuals 
(Fig. 6 d).Tetramer staining intensity of CD4 + T cells in type 1 
diabetes patients reflected avidity of the TCR for pHLA, 
with high avidity being associated with sensitivity to apop- 
tosis and regulatory function (Mallone et al., 2005). The re- 
duced MFI of cit-vimentin and cit-aggrecan-specific T cells 
in RA patients thus suggested an altered balance in regulatory 



and effector-memory T cells. Human PB CD4 + T cells can 
be subdivided into resting CD45RA + Foxp3 + CD25 + and 
activated CD45RA~Foxp3 + CD25 hl suppressive populations 
(regulatory T [T reg] cells), and Foxp3 lo CD45RA~ and 
Foxp3~CD45RA + and CD45RA~ nonsuppressive popula- 
tions, which each have potential for proinflammatory cyto- 
kine production upon stimulation ex vivo (Miyara et al., 2009, 
2011). We stained PBMCs similarly, substituting CD45RO, 
which identifies a reciprocal population to CD45RA. Among 
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Table 3. Characteristics of HLA-DRBr04:01 + RA patients 


used for optimization, enumeration anc 


1 phenotypic studies 


Demographic and clinical details 


KA patients in = 2UJ 


Age, mean (SD) 


58 (13.7) 


Female sex, n (%) 


15 (75) 


Disease duration (y), median (i.q.r) 


2 (1.25-5.75) 


ACPA+, n (%) 


15 (75) 


Current smokers, n (°/o) 


8 (40) 


Ever smokers, n (°/o) 


17 (85) 


Disease activity (DAS4v-CRP), median (i.q.r) 


2.38 (1.58-3.0) 


Treatment 




Methotrexate, n (°/o) 


13 (65) 


Hydroxychloroquine, n (°/o) 


13 (65) 


Sulfasalazine, n (°/o) 


8 (40) 


Low dose prednisone, n (%) 


3(15) 


Leflunomide, n (°/o) 


2 (10) 


Multiple antirheumatic drugs, n (°/o) 


14 (70) 


No treatment (°/o) 


1 (5) 



the total CD4 + cells in PB of HLA-DRBl*04:0i + RA pa- 
tients relative to healthy control donors, the proportion of 
resting (Fig. 6 e) and activated (Fig. 6 f) T reg cells was signifi- 
cantly reduced and the proportion of FoxP3~ effector/memory 
(Fig. 6 h) cells tended to be increased (P = 0.05).Vinientin- 
64Cit 59 _ 71 and, in most cases, aggrecan-93-95Cit 89 _ 10 3— specific 
T cells were significantly less likely to be resting (Fig. 6 e) or 
activated (Fig. 6 f) T reg cells and significantly more likely to 
have a FoxP3~ CD45RCT naive (Fig. 6 g) or CD45RO+ ef- 
fector memory (Fig. 6 h) phenotype in RA than healthy con- 
trol PBs. HA-specific T cells were also significantly more 
likely to have an effector memory phenotype in RA than 
healthy control PB (Fig. 6 h). These data indicate that the 
HLA-DRB1*0401 SE permits the selection and/ or periph- 
eral expansion of low numbers of CD4 + T cell populations 
specific for vimentin-64Cit 59 _ 71 and aggrecan-93-95Cit 89 _ 103 
self-antigens unrelated to a history of RA. This is consistent 
with recent findings in healthy HLA-DRB1*0401 + individu- 
als, where self-antigen— specific CD4 + T cells were observed 
in preenriched samples, despite the donors not suffering from 
autoimmune disease (Su et al., 20 13). The enrichment in naive 
and effector/ memory T cells and paucity ofT reg cells among 
antigen-specific CD4 + T cells, further indicates that T cell 
regulatory capacity is deficient among CD4 + T cells, includ- 
ing autoreactive CD4+ T cells in HLA-DRB1*0401 + RA 
patients relative to HLA-DRB1*0401 + healthy controls. 

DISCUSSION 

Given the central role of posttranslational modifications of 
proteins in regulating essential physiological processes, sur- 
prisingly little is known regarding the molecular basis un- 
derlying their impact on immunity (Petersen et al., 2009). 
Nevertheless, some recent advances, particularly in the area of 
T cell— mediated autoimmunity, have demonstrated the capac- 
ity of T cells to recognize HLA-restricted posttranslationally 



modified antigens. For such reactivity, an individuals immuno- 
genetics and the antigens themselves are closely associated 
with the pathogenesis of diseases, including type 1 diabetes 
(Mannering et al., 2005), celiac disease (Abadie et al., 2011; 
Broughton et al, 2012), and RA (Law et al, 2012; Smr et al, 
2011; von Delwig et al, 2010). 

The association between the HLA-DRB1 locus and RA 
has been known for over 25 yr, leading to the shared epitope 
hypothesis (Gregersen et al., 1987). Further, there is a clear as- 
sociation between these shared-epitope alleles and citrullina- 
tion, where several citrulHnated epitopes are identified in RA 
patients. Moreover, a large haplotype association study attrib- 
uted most of the HLA-DR— associated risk to positions 11, 13, 
71, and 74 of the HLA DR(3 polypeptide chain in RA, strongly 
suggesting that this allotype permits binding and presentation 
of autoantigenic peptides (Raychaudhuri et al., 2012). Our 
findings provide a comprehensive structural portrait of the as- 
sociation between RA., HLA-DRB1 , and citrulHnated peptides. 
Namely, we describe seven, high resolution, crystal structures 
of HLA DR4-Ag complexes of direct relevance to RA.. We 
show how four RA-associated citrullinated epitopes (vimentin- 
64Cit 59 _ 71 , vimentin-64-69-7 1 Cit 59 _ 71 , vimentin-7 1 Cit 66 _ 78 , and 
aggrecan-93-95Cit 8<M03 ) bound to HLA-DRB1*04:01, an 
allele that is strongly associated with RA susceptibility. These 
four structures show that the mode of binding of the P4- 
Citrulline residue within the P4 pocket of HLA-DRB 1*04:01 
is conserved, in which P4-Cit contacted positions 13 and 71 of 
the SE motif These structures provided a clear and general ex- 
planation as to (a) why P4-Arg could not be accommodated 
within the P4 pocket of HLA-DPvBl*04:01 and (b) how 
P4-Cit was accommodated within the electropositive P4 
pocket of HLA-DRB1*04:01. Namely, the P4 pocket of 
HLA-DRJ31*04:01 is highly suited to preferentially accom- 
modate citrulline over the corresponding Arg, with Lys71(3 of 
the SE playing a key discriminatory role. 

Next, we examined how HLA DR(3 polymorphisms can 
impact RA hierarchies of disease susceptibility. First, we de- 
termined the structure of HLA-DRB 1*04:04 in complex 
with vimentin-Cit71 66 _ 78 .The P4 pocket remained positively 
charged within HLA-DRB1*04:04, thereby providing a basis 
for the similar disease association of these allomorphs. Second, 
we determined the structures of the RA resistance allele, HLA- 
DRB1*04:02, in complex with Vimentin-7 lCit 66 _ 78 and 
Vimentin 66 _ 78 .The presence of Glu71|3 in HLA-DRB 1*04: 02 
enabled contacts to be formed with P4-Cit and P4-Arg. Hence, 
P4-Arg was readily accommodated in the PvA-protective HLA- 
DRJ31*04:02 allomorphs because of the conversion of an elec- 
tropositive to an electronegative P4 pocket. 

Our findings indicate that the hierarchy of disease associa- 
tion is linked to the decreasing exclusivity of the HLA-DR4 
molecules to bind citrulline, with RA-resistance alleles being 
able to bind both arginine and citrulline residues. Indeed, in 
the naturally processed and presented peptides bound to dif- 
ferent HLA-DR4 alleles, P4-Arg was only tolerated in the 
RA-resistant HLA-DRB1*04:02 allele. Citrullinated self- 
antigen-specific CD4 + T cells, were present in low but similar 
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Figure 6. CD4 + tetramer + T cells circu- 
late in RA patients and healthy controls. 

(a) PBMC from a representative HLA- 
DRB1*04:01 + RA patient were stained with 
PE-labeled HLA-DRB1*04:01-HA 306 _ 318 tetra- 
merand Brilliant Violet 421 (BV)-labeled HLA- 
DRBl*04:01-vimentin-64Cit 59 _ 71 tetramer and 
FITC-labeled anti-CDI 1c, CD 14, CD 16, and 
CD19 and APC/Cy7-labeled anti-CD4, and 
then analyzed by flow cytometry, setting 
gates on FITC~CD4 + cells based on PE- and 
BV-fluorescence minus one (FM0) staining. 
PBMCs from 9 HLA-DRB1*04:01 + RA patients 
(RA, filled circles) and 4 healthy controls 
(HC, empty circles) were stained with AQUA 
live dead discriminator, PE-labeled HLA- 



DRB1*04:01-HA,, 



-vimentin-64Citr, 



or -aggrecan-93-95Cit a9 _ 103 tetramers, Alexa 
Fluor 488 Foxp3, PerCP/Cy5.5-CD14, Pacific 
blue-CD45R0, APC-CD28, APC/Cy7-CD4, 
PE/Cy7-CD25, and count beads were added. 
Either CD4 + CD14-f,etramer + or total CD4 + 
T cells were gated and the frequency of 
CD4+CD14-tetramer + or total CD4+CD14- 
cells/ml blood was calculated (b). The number 
of HLA-DRB1*04:01-vimentin-64Cit 59 _ 71 - 
specific T cells was plotted relative to the four 
variable disease activity score (DAS4vCRP) of 
each RA patient (c). MFI of tetramer T cells (d), 
percentage of CD4 + CD45RCTCD25 + Foxp3 + 
resting T reg cells (e), CD4 + CD45R0 + CD25 hi 
Foxp3+ activated T reg cells (f), CD4+CD45R0- 
CD25~Foxp3- naive T cells (g) or CD4+CD45R0+ 
CD25~Foxp3~ effector/memory T cells (h) 
of RA patients (columns 1, 3, 5, and 7) and 
healthy controls (columns 2, 4, 6, and 8) is 
shown for each or tetramer* population (col- 
umns 1-6) and for total PB CD4+T cells (col- 
umns 7 and 8).*, P < 0.05;**, P < 0.01, using 
the Mann-Whitney test to compare RA pa- 
tients and healthy controls. The number of 
PB HLA-DRBl*04:01-virnentin-64Cit 59 _ 71 - 
specific T cells were correlated with disease 
activity score in RA patients (Spearman r = 
0.76; P < 0.05). 



numbers in the CD4 + T cell peripheral blood repertoire of 
HLA-DRB1*04:01 + RA patients and healthy controls. Our 
data imply that the exclusion of P4-Arg and acceptance of 
P4-Cit by HLA-DRB1*04:01 leads to the presentation of 
peptides that can interact with the corresponding autoreac- 
tive T cell repertoire to increase selection and/ or expansion 
of autoreactive CD4 + T cells. T cells of highest self-reactivity 
escaping the affinity threshold for deletion in the thymus are 
found among the natural T reg cell population (Hsieh et al., 
2004). Indeed, the autoreactiveT cells in HLA-DRB1*04:01 + 
healthy controls were enriched in resting and activated Foxp3 + 
T reg cells, and MFI reflecting tetramer binding avidity was 
higher in healthy controls than in RA patients, whose auto- 
reactive T cells were relatively deficient in T reg cells. The 



correlation between antigen-specific T cell frequency and 
RA disease activity suggests disproportionate peripheral ex- 
pansion or survival of effector/ memory cells relative to T reg 
cells in RA patients, potentially due to antigen presenting cell 
activation or IL-2 availability, on which RA genetic back- 
ground and inflammation impact (Li et al., 2013; Pettit et al., 
2000;Viatte et al., 2013). These data are consistent with pre- 
viously reported expansion of total CD4 + CD28~ T cells in 
RA PB, correlated with disease activity (Scarsi et al., 2010; 
Sempere-Ortells et al., 2009). Further, the higher proportion 
of FoxP3~ autoreactive effector/ memory T cells in RA pa- 
tients indicates higher cytokine production potential in re- 
sponse to presentation of citrullinated autoantigens at sites of 
inflammation including the lung of smokers and RA joints, 
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which has been demonstrated ex vivo (Law et al., 2012; 
Makrygiannakis et al., 2008; Wegner et al., 2010). This recog- 
nition is amplified by the protection of regions of citrullinated 
antigens from proteolysis, thereby promoting the presentation 
of citrullinated self-epitopes to autoreactive T cells. Collec- 
tively, our findings have reshaped our understanding of the 
association between citrullination, the HLA-DRB1 locus, 
autoreactive T cells, and their regulation in RA. 

MATERIALS AND METHODS 

Mammalian expression vector construction. The extracellular domains 
of the HLA-DR4 (DRA*01:01/DRB1*04:01, *04:02, and *04:04) a and (3 
chains were cloned into the pHLsec (Aricescu et al., 2006) vector for expres- 
sion in HEK 293S (GiiTI - ) cells (Reeves et al., 2002). Constructs contained 
C-terniinal enterokinase cleavable fos/jun zippers to promote dimerization. 
The P chain also contained a BirA site for biotinylation and tetramer genera- 
tion and a Histidine tag for IMAC purification. HLA-DR4 was expressed 
with the class II— associated invariant chain peptide (CLIP) covalently at- 
tached via a Factor Xa cleavable flexible linker to the N terminus of the 
P chain and is preceded by a Strep-II tag (IBA; Gottingen) for purification. 

Expression and purification. The HLA-DR4CLIP construct was tran- 
siently expressed in HEK 293S (GnTI - ) cells and soluble protein was puri- 
fied from the culture medium. In brief, culture medium was concentrated 
and buffer exchanged via the Cogent Ml TFF system (Merck Millipore) into 
10 mMTris, pH 8.0, and 500 mM NaCl.The proteins were then purified 
using IMAC via Ni Sepharose 6 Fast Flow (GE Healthcare) and size exclu- 
sion chromatography (Superdex 200; GE Healthcare) in 10 mMTris, pH 8.0, 
150 mM NaCl. HLA-DR4CLIP was cleaved with Factor Xa for 6 h at 21°C 
before peptide exchange. HLA-DR4 was subsequently loaded with test pep- 
tides by incubating for 16 h at 37°C in a 20-fold excess of peptide in 100 mM 
sodium citrate pH 5.4 in the presence of HLA-DM at a HLA-DR4:DM 
ratio of 5:1. Peptides were sourced from GL Biochem at a purity of >95%. 
The aggrecan-93-95Cit 89 _ 103 peptide was modified with a glycine to tyrosine 
mutation at position 92, to stabilize the HLA-DRB1*04:01— aggrecan-93- 
95Cit 89 _ 103 complex for structural and tetramer studies. Peptide-loaded HLA- 
DR4 was then purified from HLA-DR4CLIP using Strep-Tactin Sepharose 
(IBA; Gottingen). The unbound protein was concentrated and buffer ex- 
changed into 25 mMTris, pH 7.6, and 50 mM NaCl, followed by removal of 
the fos/jun zipper by cleavage with enterokinase (GenScript) for 16 h at 
21°C. Enterokinase-cleaved, peptide-loaded HLA-DR4 was then purified 
further via anion exchange chromatography (HiTrap Q HP; GE Healthcare), 
then buffer exchanged into 10 mM Tris-HCl, pH 8.0, 150 mM NaCl and 
concentrated to 6 mg/ml for crystallization. 

Thermal stability assays. Thermal stability assays of HLA-DR4 peptide 
complexes were performed using a Real-Time Detector instrument (Cor- 
bett RotorGene 3000). In brief, HLA-DR4 peptide complexes were pre- 
pared at either 10 or 20 uM in 10 mM Tris, pH 8.0, 150 mM NaCl. SYPRO 
orange (Invitrogen) was added to monitor unfolding, samples were heated 
from 30°C to 95°C at l°C/min and the change in fluorescence intensity was 
recorded at excitation and emission wavelengths of 530 and 555 nm, respec- 
tively (Table 2). 

Crystallization and structure determination. Crystal trays were set up 
using the hanging-drop vapor diffusion method at 20°C. Protein and a 
mother liquor of 100 mM BTP,pH 7.3,22-28% (vol/vol) PEG3350,and 0.2 M 
KN0 3 were mixed at a 1:1 ratio. Platelike crystals typically grew within 
5 d. Crystals were flash frozen in 16% (vol/vol) ethylene glycol before data 
collection. Data were collected at the MX1 or MX2 beamlines at the Austra- 
lian Synchrotron and processed using the program Mosflm.The structures 
were determined by molecular replacement using the program Phaser 
and subsequently refined using Phenix and iterations of manual refinement 
using Coot (Table l).The structures were validated using MOLPROBITY. 



Human subjects. 20 patients who fulfilled the 1987 American College of 
Rheumatology (ACR) criteria for RA (Aletaha et al., 2010) and 6 ACPA" 
SE + healthy controls were included. All individuals provided peripheral 
blood (PB) samples, although the yield was insufficient for all assays in some 
cases. Patient demographic details are outlined in Table 3. Disease activity 
scores (DAS4vCRP) were determined on the day of blood sampling for the 
study. HLA-DR genotyping was performed at Queensland Health Pathol- 
ogy Services. The study was approved by the Metro South and University of 
Queensland Human Research Ethics Committee, and informed consent was 
obtained from each individual. 

Tetramer generation. HLA-DR4 peptide samples were buffer exchanged 
into 10 mM Tris, pH 8.0, and biotinylated as described previously (Broughton 
et al., 2012). The percentage of biotinylation was determined by native gel 
electrophoresis and complexation with avidin. Tetramers were generated by 
the addition of streptavidin-PE (BD) or strep tavidin-Brilliant Violet (BV421; 
BioLegend) in an 8:1 molar ratio. 

Tetramer staining. Initial staining optimization was required as cells were 
rare, and HLA-DR4 tetramer staining intensity was low PBMCs from HLA- 
DRB1*04:01 + RA patients and healthy controls were thawed from frozen 
aliquots, stained with 4.2 u.g/ml PE-labeled tetramers; aqua live-dead dis- 
criminator (Invitrogen); FITC-labeled anti-CDllc, -CD14, -CD16, and 
-CD19; and APC/Cy7-labeled anti-CD4 in the presence of 50 nM dasatinib 
(Selleckchem). Live CD4 + T cells were gated and non— T cell lineage" 1 " cells 
were excluded, and then enriched with anti-PE immunomagnetic beads 
(MACS; Miltenyi Biotec).The HLA-DR4 tetramer gate was set for CD4 + 
T cells based on PE fluorescence minus one (FMO) staining. Inclusion of 
50 nM dasatinib (Selleckchem) during staining markedly increased the de- 
tection of tetramer + T cells. Whereas immunomagnetic enrichment with 
anti-PE-beads (MACS; Miltenyi Biotec) after staining reduced the number 
of cells required for acquisition by the flow cytometer, it underestimated the 
frequency and skewed the phenotype of tetramer 4 " T cells. Following these 
optimization experiments, immunomagnetic enrichment was not used; each 
sample of PBMC was divided into three, and each stained with one PE- 
labeled tetramer, aqua live-dead, anti-CD14-PerCP/Cy5.5, anti-CD4-APC/ 
Cy7, anti-CD45RO-Pacific blue, anti-CD25-PE/Cy7, anti-Foxp3-Alexa 
Fluor 488, and anti-CD28-APC (BioLegend and BD), and then analyzed 
using a Gallios flow cytometer and Kaluza software (Beckman Coulter). 
HLA-DR4 tetramer gating based on PE FMO staining was kept constant for 
the entire study. The frequency of CD4 + CD14~ tetramer" 1 " cells/ml blood 
was calculated based on cell number determined by addition ofTruCOUNT 
beads (BD). 

Mice and immunization. I-A b_/_ C57BL/6 mice expressing a chimeric 
class II transgene containing the aipi domains of human DRA1*0101- 
Bl*04:01 on a mouse IE d backbone (DR04:01-IE mice) were obtained from 
Taconic and bred and housed under specific pathogen— free conditions at 
University of Queensland. Experiments were approved by the University of 
Queensland Animal Ethics Committee. Draining lymph nodes of mice im- 
munized with 1 jag Fluvax 2012 (CSL) or saline-treated mice were removed 
4 wk later and stained with 7-AAD, FITC-labeled anti-CDllc, CD14, CD16, 
and CD19, CD4-APC, and PE-labeled DRB1*04:01-HA 306 _ 318 tetramer 111 
the presence of dasatinib. Cells were analyzed using a Gallios flow cytometer 
and Kaluza software. Live CD4 + T cells were gated and non— T cell lineage + 
cells were excluded. Gates for the pHLA-II tetramer staining were set based 
on PE FMO staining. 

Statistical analysis. The Kruskal-Wallis test with Dunns' Multiple Com- 
parison Test compared multiple means. Significance is indicated as *, P < 0.05; 
**, P < 0.01; ***, P < 0.0001. All error bars represent SEM. 

Preparation and digestion of citrullinated vimentin. Recombinant 
human vimentin was generated as an N-terminal hexahistidine fusion 
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protein in S£9 insect cells using the Bac-to-Bac Baculovirus Expression Sys- 
tem (Invitrogen). AfFinity-purified protein was incubated with rabbit skeletal 
PAD (Sigma-Aldrich) for 16 h at 37°C in 0.1 M Tris-HCl (pH 7.6) contain- 
ing 10 niM CaCl 2 and 5 mM dithiothreitol. Citrullinated and control pro- 
teins were further purified using Nickel Sepharose (GE). Human cathepsin L 
was expressed in Pichia pastoris (system donated by D. Bromine, University 
of British Columbia, Vancouver, Canada), purified and titrated using E-64 
as described previously (Bromine et al., 1999). Cathepsin L was preactivated 
by incubation in 0.1 M acetate, 1 mM EDTA, and 10 mM cysteine, pH 5.0 
for 30 min at room temperature, and 2 nmol cathepsin L was used to digest 
the vimentin proteins at pH 5.0. At the indicated times, samples were acidi- 
fied and desalted using a C18 Zip-tip. Samples were eluted with 50% (vol/vol) 
acetonitrile/0.1% (vol/ vol) formic acid, concentrated and separated on an 
Eksigent Ultra cHiPLC system using a gradient of 5— 80% (vol/vol) Acetoni- 
trile for 90 min, and analyzed online using an AB SCIEX 5600+ TripleTOF 
high resolution mass spectrometer. 

Repertoire analysis of HLA-DR4 allomorphs. T2-DRB1*04:01, *04:02, 
and *04:04 cells expressing DM were generated via retroviral transduction 
of the parental T2 line as previously described (Pang et al., 2010). Cells 
were expanded in RPMI-10% FCS and pellets of 10 9 cells snap frozen in 
liquid nitrogen. Cells were ground under cryogenic conditions and resus- 
pended in lysis buffer (0.5% IGEPAL, 50 mM Tris, pH 8.0, 150 mM NaCl 
and protease inhibitors) as previously described (Dudek et al., 2012; Illing 
et al., 2012). Cleared lysates were passed over a protein A precolumn fol- 
lowed by an affinity column cross-linked with a monoclonal antibody spe- 
cific for HLA-DR (LB3.1). Peptide— MHC complexes were eluted from 
the column by acidification with 10% (vol/vol) acetic acid. Peptides were 
isolated using reversed-phase HPLC (Chromolith C18 Speed Rod; Merck) 
on an Akta Ettan HPLC system (GE HealthCare). Fractions were concen- 
trated and analyzed using an AB SCIEX 5600+ TripleTOF high-resolution 
mass spectrometer as previously described (Dudek et al., 2012). Acquired 
data were searched against the human proteome (Uniprot/Swissprot v2012_7) 
using ProteinPilot software v 4.5 (AB SCIEX). The resulting peptide identi- 
ties were subject to strict bioinformatic criteria, including the use of a 
decoy database to calculate the false discovery rate (FDR). A 5% FDR cut- 
off was applied and the filtered dataset was further analyzed manually to 
exclude redundant peptides and known contaminants. To generate motifs, 
the minimal core sequences found within nested sets were extracted and 
the resulting list of peptides were aligned using MEME (http://meme 
.nbcr.net/meme/), where motif width was set to 9—15 and motif distribu- 
tion set to 'one per sequence' (Bailey et al., 2009). Peptides derived from 
HLA or immunoglobulin molecules were not included in the final motif 
analysis. Motifs were submitted to Icelogo for visualization using the fre- 
quencies of amino acids in the human proteome as a reference set (Colaert 
et al., 2009). 
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